Document Logical Structure Analysis Based on Perceptive Cycles
نویسندگان
چکیده
This paper describes a Neural Network (NN) approach for logical document structure extraction. In this NN architecture, called Transparent Neural Network (TNN), the document structure is stretched along the layers, allowing an interpretation decomposition from physical (NN input) to logical (NN output) level. The intermediate layers represent successive interpretation steps. Each neuron is apparent and associated to a logical element. The recognition proceeds by repetitive perceptive cycles propagating the information through the layers. In case of low recognition rate, an enhancement is achieved by error backpropagation leading to correct or pick up a more adapted input feature subset. Several feature subsets are created using a modified filter method. The first experiments performed on scientific documents are encouraging.
منابع مشابه
Structure Extraction in Printed Documents Using Neural Approaches
This paper addresses the problem of layout and logical structure extraction from image documents. Two classes of approaches are first studied and discussed in general terms: data-driven and model-driven. In the latter, some specific approaches like rule-based or formal grammar are usually studied on very stereotyped documents providing honest results, while in the former artificial neural netwo...
متن کاملRéseau de neurones dynamique perceptif - Application à la reconnaissance de structures logiques de documents. (Dynamic and perceptive neural network applied to document logical structure recognition)
Logical structure extraction of documents remains a challenging problem due to their inherent complexityand the gap between the physical features extracted from the image and their corresponding logicalinterpretation. Most of the literature approaches propose model-driven approaches which are not genericenough to handle complex and noisy documents. They do not use intermediate inter...
متن کاملInterest of perceptive vision for document structure analysis
This work addresses the problem of document image analysis, and more particularly the topic of document structure recognition in old, damaged and handwritten document. The goal of this paper is to present the interest of the human perceptive vision for document analysis. We focus on two aspects of the model of perceptive vision: the perceptive cycle and the visual attention. We present the key ...
متن کاملOil and Iran Regions Rural Economic Structure Alteration
The oil has gradually obtained a predominant place in national economy since 1950 and nowadays, is the main important resource securing country financial needs. Two questions are the base of this research regarding contradiction of oil rent and traditional economic sectors including agriculture and livestock rearing which always have been intensified. These two questions are as follows: what ar...
متن کاملPerzeptive Syntagmen für Bewegtbilddokumente
This thesis researches the analysis and the formalised metadata mapping of moving data. Therefore a new syntagma for structuring document subparts in movies is being developed. The dissertation focuses on syntagmata which for the first time consider subparts of a document below the shot level. Until now, a set of shots could only be syntagmatically classified, if they were spatially connected. ...
متن کامل